Word-wise Hand-written Script Separation for Indian Postal automation

نویسندگان

  • K. Roy
  • U. Pal
چکیده

In a multi-lingual multi-script country like India, a postal document may contain words of two or more scripts. For recognition of this document it is necessary to separate different scripts from the document. In this paper, an automatic scheme for word-wise identification of hand-written Roman and Oriya scripts is proposed for Indian postal automation. In the proposed scheme, at first, document skew is corrected. Next, using a piecewise projection method the document is segmented into lines and then lines into words. Finally, using different features like, water reservoir concept based features, fractal dimension based features, topological features, scripts characteristics based features etc., a Neural Network (NN) classifier is used for word-wise script identification. For experiment we consider 2500 words and overall accuracy of 97.69% is obtained from the proposed identification scheme.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automation of Indian Postal Documents Written in Bangla and English

In this paper, we present a system towards Indian postal automation based on pin-code and city name recognition. Here, at first, using Run Length Smoothing Approach (RLSA), non-text blocks (postal stamp, postal seal, etc.) are detected and using positional information Destination Address Block (DAB) is identified from postal documents. Next, lines and words of the DAB are segmented. In India, t...

متن کامل

International Journal of Applied Science & Technology Research Excellence Vol. 1, Issue 1, Nov-Dec 2011, ISSN NO. 2250 – 2718 (Print), 2250 – 2726 (Online)

In this paper, we present a system towards Indian postal automation based on PIN (Postal Index Number) code. Since India is a multilingual and multi-script country that was earlier colonized by UK, the address part may be written by combination of scripts such as Latin (English) and a local (state) script. Here, we shall consider Oriya script one of the local state language in India with Englis...

متن کامل

Handwritten Devanagari Numeral Recognition by Fusion of Classifiers

The abstract is to Recognition of handwritten Devanagari numerals has many applications especially in the field of postal automation, document processing and so on. Due to its vast applications, many researchers are actively working towards development of effective and efficient hand written character/numeral recognition. Devanagari script is widely used script in Indian sub-continent, also dev...

متن کامل

A Survey on Devanagari Character Recognition for Indian Postal System Automation

The commercial and industrial applications, namely, business form reading, bank cheque reading, and full postal address reading, have constituted the majority of the market for handwriting recognition technology. Devanagari is the most popular script in India and is used to write texts in Hindi, Marathi and Nepali languages. Significant amount of work has been done on Devanagari isolated charac...

متن کامل

Cursive Script Postal Address Recognition Abstract Cursive Script Postal Address Recognition

Cursive Script Postal Address Recognition By Prasun Sinha Large variations in writing styles and di culty in segmenting cursive words are the main reasons for cursive script postal address recognition being a challenging task A scheme for locating and recognizing words based on over segmentation followed by dynamic programming is proposed This technique is being used for zip code extraction as ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006